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Abstract 

We address the expected supremum of a linear combination of shifts of the sine kernel 
with random coefficients. When the coefficients are Gaussian, the expected supremum is 
of order yTogre, where n is the number of shifts. When the coefficients are uniformly 
bounded, the expected supremum is of order log log n. This is a noteworthy difference 
to orthonormal functions on the unit interval, where the expected supremum is of order 
\J n log n for all reasonable coefficient statistics. 
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1 Introduction 

Perhaps the most fundamental functions in signal processing are shifts of the sine kernel 

sin-7rt 



This kernel decays slowly in time, and consequently it is generally not used in prac- 
tice. Nonetheless, it is the starting point, certainly historically, for much of signal processing, 
information theory and sampling theory. If each shifted kernel has a random coefficient, it 
is natural to investigate the properties of the resulting signal. Here we address the expected 
supremum of such a signal. We let {a^}^ be independent random variables and consider the 
quantity 



sup 



sin7r(t — k) 



- 7T(t - k) 



(1) 
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We investigate the behavior of the peak when the {dk}kLi are Gaussian and symmetric ±1 
random variables, and show that in the Gaussian case E,S n ~ yf\ogn while in the ±1 case 
~ES n ~ log log n. This result is fundamental enough to be relevant in numerous settings. One 
example is when coefficients are quantized and the {a fc }^ 1 represent the difference between 
an actual coefficient and its quantized value. Another is when the coefficients are viewed as 
carrying information, and one is concerned with the peak value of the signal. We discuss this 
briefly below. 

2 Problem Formulation and Main Result 

We compare a signal of the type given in (OQ) with linear combinations of orthonormal functions 
on the unit interval. Here, the fundamental theorem, due to Kashin and Tzafriri flU, states that 
1 are uniformly bounded (i.e. in || ■ W^) orthonormal functions on [0, 1] and {a/ c }^ =1 
are independent symmetric random variables with a uniform bound on the third moment, then 



Thus, for uniformly bounded functions on the unit interval, the necessary linear combinations 
occur and result in Gauss-like behavior. Consequently, the statement is not sensitive to the dis- 
tribution of the individual coefficients. Note, though, that the uniform bound on the functions 
{4>k}kLi is essential to the result. 

The theorem just stated applies to two systems of practical importance, namely the Fourier 
and Walsh systems, which are known in electrical engineering as OFDM and CDMA sys- 
tems. Here the motivation for understanding the behavior of a signal's peak is that amplifiers 
particularly distort or eliminate the peak. This has led to extensive research in communica- 
tions engineering on what is called the peak-to-average power ratio. See the book for an 
overview of this area for OFDM (Fourier) and [2] for recent work on the CDMA (Walsh) case. 

Here we address similar questions for the shifted sine kernel on the real line. We make 
several introductory observations about the equation (OQ) before formally stating the problem. 
Note that when t — I, I an integer, the value of the function inside the absolute value bars 
equals a h so that max!< fc < n \a k | is an a priori lower bound on <S n . Thus, a first point of interest 
is to compare the signal's peak off the set of integers to that at the integers. A second point 
is to compare the peak behavior when the random coefficients are ±1 random variables and 
when they are Gaussian variables. For example, the simple lower bound max!< fc < n \a^\ does 
not grow in n for ±1 random variables. (Uniformly bounded, zero-mean random variables will 
be shown to behave the same as random ±1, and so we discuss only the latter at this point.) In 
such a linear combination one has sums of other independent random variables, yet it is unclear 
a priori if they behave close to Gaussian random variables. 

We clarify the dichotomy between orthonormal functions on the unit interval (multi-carrier 
systems in communications) and shifted kernels on the real line (single-carrier systems). In the 
former, the expected peak value behaves like y/n log n as long as the individual distributions 
satisfy a third moment condition. In the latter case, the behavior depends on the individual 




(2) 
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distributions. The linear combination of ±l's does not behave like Gaussian random variables 
and, in particular, the expected value of the supremum is significantly smaller in the ±1 case. 
The behavior in the Gaussian case follows from well-known theorems due to Slepian and Su- 
dakov, and so the contribution here is the ±1 case. 

Recall the definition of S n from equation (OQ). Our main theorem is the following. 

Theorem 2.1 Assume the random variables {a k } are independent and distributed according 
to jV(0, a 2 ). Then there exists a constant C\ > independent of a such that 



Cio^logn < E 5 n < q 1 cry / log 



n 



for all large n. If{dk}kLi have symmetric distribution and there exist constants M and m such 
that P(|afc| > M) = and E|a^| > m > for all k, then there exists a constant c 2 > 
independent of M and m such that 

c 2 m log log n < E S n < M log log n 

for all large n. 

Before turning to the proof, we briefly highlight how the result is tied to the non-integrability 
of the sine kernel. If a kernel is unbounded, then a linear combination of shifts of the kernel 
will generally be unbounded, and so we may consider only bounded kernels. If the kernel s is 
bounded and integrable, then one has 

sup y2\s(k+t)\ < c. 

Therefore, 

n 

sup > a k s(t — k) < max \a k \ sup > \s(t — k)\ 

1 - k ~ n *e[o,i) ^ 

< C max |ai.|. 

l<k<n 

Therefore, if the random variables {afc}^ =1 are uniformly bounded, a linear combination of the 
form Ylk=i a k s (t ~ k) ls a ^ so uniformly bounded. Thus, the statement in Theorem 12. II is a 
consequence, as one expects, of the non-integrability of the sine kernel. 



3 Proof of Main Result 

As commented earlier, the Gaussian case follows from theorems of Slepian and Sudakov. For 
the ±1 case, we first reduce the problem to determining the expected maximum over a finite 
set. Working with this finite set will be the majority of the paper. We first prove a proposition 
that is unencumbered by several details that are necessary for the full proof. We do this to em- 
phasize the aspect of the proof that is most important, namely the statement of the proposition. 
Additionally, we think that the proposition could quite likely be useful elsewhere. The proof 
of the main theorem then brings the original problem statement to the form addressed by the 
proposition. 
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Proposition 3.1 Assume {a^}^ are independent, symmetric and satisfy P( | > M) = 
for some M < oo and E\a k \ > m > 0/or a// k. For 1 < k < n let 

k _^ n—k+1 

X k = ^ -jdk-i+i + ^ -jdi+k-i- 

1=1 1=2 

Then there exists a constant c > independent of M and m such that for all large n 
cm log logn < E( max \X k \ ) < c _1 Mloglogn. 

V l<fc<n / 

Proof We assume that ra is large enough for several simple inequalities to hold. For 1 < k < n 
we set 



1 

Y k = ^2 J a k-i+i (3) 



and 



We have 



i=i 

n-k+l 



Z k = ^ \ a l+k-l- (4) 



I 

=2 



E[ max \X k \ ) < E( max |Y"J ) +E( max 

V Kk<n J \ Kk<n J \ Kk<n 



Since the {afc}^ 1 are not required to be identically distributed, maxx< fc < n \Y k \ and max^^ \Z k \ 
are not necessarily identically distributed. However, the same technique can be used to bound 
the expectation of both terms, and so we give the argument for the Y term and then apply it to 
both. 

Now let L be a number 2 < L < n to be chosen later. If k < L, then 

k 

\Y k \ <Mj2j <M + M\og{k) < M(l + logL). (5) 

i=i 1 



If k > L define 

k 



Y k - 2^/ J a k-l+l- 



I 

l=L+l 



(6) 



Therefore 



\Y k 


+ 


max 






L<k<n 


\Y k \ 


+ 


max 






L<k<n 



1=1 



< 2M(1 + logL) +E( max \Y^ L) \). 

\ L<k<n J 
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Using YIUl+i F — T> Hoeffding's inequality, Theorem 2 in [|4]|, gh 



l=L+l 



. gives 

> t) < 2e' Lt2 ' 2M2 

for every £ > and for every k — L + 1, . . . , n. Then, for any 5 > 

= / P( max |Y, (L) | > tjtft 

\L<k<n K J 



E( max 

V L<k<n 



Y, 



< 6 



P( max > t)dt 



< S + 2nj e- Lt2 ' 2M2 dt 



s nV2nM c _ S 2 L/2M 2 



Setting 5 = M^jp, we have 



Therefore 

El max \Yk 

\ l<k<n 

Setting L = log n, we have 



< 



2M(l + logL) + M^^ + M^£. 



El 



(7) 
(8) 
(9) 



l( max iFfeh < 2M(2 + log log n) + M ^=L , 
Vi<fe<n' 7 v/logra 

and for n large enough, applying the same argument to Z k , 

E( max \X k \ ) < 4Mloglogn. 

V l<fc<n 7 

Now we prove the lower bound. Let {ek}kLi be independent symmetric ±1 random vari- 
ables, so that {e^a^l}^ has the same distribution as {ofc}fcLi- Let [•] denote the integer part 
of a positive real number. For a constant C\ > 0, consider the subintervals {j \c.\ log 2 n) + 
1, . . . , (j + 1) [ci log 2 n] } for j = 0, . . . , [n/[ci log 2 n]] — 1. Let J denote the number of such 
intervals, i.e. J = [n/[ci log 2 n]]. In [3] it is shown in equation (2.14) that there exist constants 
< ci, C2, C3 such that with probability at least 1 — e~ nC3 , the sum of the r.v. {tj} corresponding 
to at least one of the subintervals just described of length [c\ log 2 n] is at least c 2 log 2 n. 
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For completeness we sketch the argument given in [0. Setting 7 = and K = c\ log 2 n, 

P(number of +l's in each interval is less than ja log 2 n) 
< (P(number of +1 's in one interval is less than jci log 2 n)) 1 <~ 1 
= (1 — P(number of +l's in one interval is greater than 7C1 log 2 n))^" 1 

'IC 



1-2- J] 

7 E'</<i : s: 



< (l-A 1 X- 1 /2 2 ^(^)-i))f-i, (10) 

where in (flOl) we have used Stirling's formula for | < 7 < 1, where for < x < 1, h is the 
entropy = a;log 2 (-) + (1 — 2) log 2 (Y^). By choosing c 2 small enough, /i(^jp-) — 1 = 
=1 + Ma < for some < 5 2 < ± Then 

ci ci z 2 

(1- Ai(ci logsn)- 1 /^" 110 ^^ 1 ^)- 1 ))^^- 1 
= (1 - A 1 (c 1 \og 2 n)- 1/2 2- ( ^ 1 - 252)lo ^ n )^^~ 1 
= (1 - ^(cilogan)- 1 / 2 ™^ 1 - 252 ')^^- 1 

< (1 — (ci \og 2 n)n~( 1 ~ 5 '^) c i 1 °z2 n ~ 1 

< Ce~ nS2 (1 - ( Cl log 2 n)^ 1 -^)- 1 



for n large enough. This proves the claim made above. 

We now consider the random variables X k . for kj = [[c\ log 2 n)/2] + j[cxlog 2 n), j = 
0, . . . , [n/[ci log 2 n]} — 1 and bound the size of each weighted sum of the {a^} outside the 
interval containing kj. That is, using Hoeffding's inequality again, for a given kj, 



P 



1 1 



>t\ (11) 



«=[cilog 2 n/2]+l Z=[cilog 2 n/2] + l 

< 2exp(-[ci log 2 n] t 2 /2M 2 ). 
The probability that £ is exceeded for some kj is bounded by 

2nexp(-[ci log 2 n] t 2 /2M 2 ). 

By just setting i = AM/ y/c\ we have that each sum of the form inside (fTT|) is bounded by 
4M/ v /cT with probability at least 1 - 

Let £?o denote the event that both at least one subinterval of length [c\ log 2 n] satisfying the 
property discussed from [3 J exists and that each sum outside this interval of the form inside (fTTI) 
is bounded by AM/ ^fc[. This event occurs with probability at least 1 — - for large n. That 
is, when E occurs, there exists an interval where the number of +l's is at least ^4rM c i 1°§2 n ] 
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and the number of — l's is at most ^-^[ci log 2 n). Denote by k* the kj corresponding to the 
interval with sufficiently many +l's. Then, 



E( max \Xu\ 

Kk<n 



> E[ max \Xk-\ 

Vo<j<j-i 3 

> E[ max \Xk\ 

Vo<j<j-i 3 

2 



EMEo) 



> 



max 

o<j<j-i 



-En 



/[cilog 2 n/2] [cilog 2 n/2] 

- ( 1_ n) E E yefc'+i-ilafc'+i-il + E T e fe*-i] 
y i=i ' i=i 



a>k*-i\ 



En 



2M 



(12) 



We look at the expectation term in (fT2l) and apply Lemma [3^21 which is given below. For each 
Ci log 2 n], all the subsets of a given interval of length [ci log 2 n] with the number of 



r > 



1+C2 



+l's equalling r are equally probable. If we condition on a realization of the {|ciA:|}> then the 
|dfc| with the appropriate | factors before them correspond to the b's in the lemma. Since each 
r y i±c2 ^ j Q g 2 n j a i ower bound on the k from Lemma [3^2] is = — ^ [ci log 2 n] and 



2 i 4p[cilog 2 n] - [cilog 2 n] 



[c x log 2 n] 

This holds for any realization of the coefficients {ak}k=i- Therefore, 

2 /[eilog 2 n/2] [cilog 2 n/2] 

> (i--> 2 e E T i«*--M— i h- E ji 



2M 



> (l--)c 2 

n/ 



[cilog 2 n/2] [cilog 2 ra/2] 

Em \ -\ m 

T+ E T 



2M 



> 2^1 - ^)c 2 «i log 2 ^ 



1=1 1=1 

[ Cl log 2 n]\ 2M 



> c 4 m log log n 
for a constant c 4 for all large n. 



Lemma 3.2 Le? e G {±1} P &e uniformly distributed on 

{xe{±l} p : Xi = l}\ = k} 
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for some 1 < k < p and let bi, . . . , b p be real numbers. Then 



v 

p 



i=i ^ i=1 

Proof Let X denote the set of subsets of {1, ... ,p} of cardinality k. The number of subsets 
I G X such that i G I is equal to the number of subsets of{l, . . . , p}\{i} of cardinality k — 1. 
The cardinality of this set is (^"J = Therefore, 



*.(X» = (ff EE^-E^ 

i=i v / lex iei i$i 

y 



(EE". -EE".) 

lex iei lex i^i 
-IP p 

(%2bi\IeT: iel\-^2bi\l el: i£ /|) 

i=i i=i 



F i=l 



Using Proposition 13. 11 we can now prove the main theorem. 
Proof of Theorem 12.11 We start with the upper bounds for both the Gaussian and compact 
support cases. If t < —n, then 



E 

fc=i 



sin ir(t — k) 



7T 



(t-k) 



< max afc 

Kk<n 



2n 1 

y — 



fc=n+l 



< max lafcl, 

KKn 



and the same argument holds for t > 2n. Therefore we bound the expectation of the supremum 
over t G [—n, 2n], which will always be at least the order of maxi< fc < n \a k \. Throughout we 
use I sinxl < Ixl. We have 



sin ir(t — k) 
te[-nM i — ?r(t - fc) 



sup y^a fc - 



< max sup 

-n<K2n-l tg [ 0)1 ] 



max sup 

-n<l<2n-l te[0,l] 



n 

k=l 
n 



sin7r(Z + t — k) 
%(l + t- k) 



E 

k=l 



-1 



d-k 



shi7rt 



dk- 



7t(l + t- k) 



We use 



sup max 

te[0,l] ~n<l<2n- 



JE 



— 1) shi7rt 



fc=i 



n(l + t- k) 



< sup max flfc — 

,,, ^ i -n<K2n \ ^— ' 7T 
l*l^n ~~ fc=l 



— 1) sin7rt 



(l + t-k) 



+ sup max > 

1 ... . 1 _^<-;<-9r) ' 



^— l) z fe sin7ri 



I<|t|<I ~"<*<2n I ^ TT{l + t-k) 

For the term (fT3l) . we choose an arbitrary — n <lQ<2n and obtain 



(13) 
(14) 



, » (-l)'o- fe sin7rt 
sup > a fc — 



< max 

Kk<n 



•J^ I sin 7rt I 

a* SU P I, , I — 77 

|*|<ifci 7r l^o + *-^l 



< max |ajt| sup 

l<fc<n. |t |<i 



sin 7rt | 



+ E 



sin 7rt| 



f 1 V- 1 

< max ^ sup 1 H — > — 

i<k<n \ n ^— ' Z + £ 

\ fc=l "/ 



7r|/ + t- fc| 
i- fcj 



< max ^ 

l<Jfe<n 



< max | afc | ( 1 + 



Kk<n 



< 2 max I afc I . 

Kk<n 



n 



n V n — 1 



+ 1 + log n 



(15) 



Since this holds for each I we have bounded (1131) by (fl5l) . Now we look at (|T4l) . and use that if 
/ is differentiable on the interval [a, 6], then 

sup |/WI<max{|/(o)|,|/(6)|} + |6-o|. sup (-£/)(*) 



i£[a,6] 



We then have 



max sup 

-n<l<2n— 1 l^ui^i i ~ 
-<l*l<2 fe 



E 



1)' fc sin7r£ 



< max 

l=— n,...,2n, r— 



J ir(l + t- k) 

l) l ~ k sin7r- 



E 

l,...,n-l I 



ttfc 



E 



1 1 ^ (-iy- fc sin7rt 



Ofc" 



max sup , . 

-n<i<2n i < i t i < i n I at f—* ir{l + t — k) 

n — I I — 2 k — 1 
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For the second term we have the bound Then 



1 1 d yr-^ ( — sin7rt 
max sup — — > a k — — 

-n<K2n-l i<| t |<i Tl\ at ^ TT{1 + t — k) 



max sup 

-n<l<2n-l 1^1.1^1 Tl 

n-l r l- 2 



l _ k n 2 (I + t — k) cos nt — 7r sin %t 



7T 2 (/ + t — k) 2 



< — max la J max 



Tl l<k<n 



-n<K2n 



< 



max Ofc max sup 



n Kfc<n 



^lE^- 1 ) 

/c=i 

(n ^ n 

E^F^ + ^ 
fc=l 1 1 k=l 

(l A 1_ 



sin 7rt 



7r(/ + t - A;) 2 
smut 



-n<K2n-l i 



k\ 



+ 



nt 2 



k=l,k^l 



ix{l + t-k) 2 



< — max I Ofc I (2n + 2 log n + 2) 

n l<fc<n 

< 4 max I a/; I 

Kk<n 



(16) 



for sufficiently large n. Thus, in both the Gaussian and compact support cases we have to find 
a bound on 

A (-l^sin^ 



max 

l=— n,...,2n: r= 



E 

l,...,n-l I 



fc=l 



7r(Z + i - k) 



(17) 



We start with the Gaussian case. For / = — n, . . . , 2n, and r = 1, . . . , n — 1 set 

n . r 

sin 7T- 

k= 



6, = E 



The are Gaussian random variables, so that using inequality (3.13) in O, 

e( max |& < 3(Tv/log 3n 2 + max (E|A r | 2 ) 1/2 

V -n<l<2n,l<r<n-l ' J -n<l<2n,l<r<n-l ' 

n. 

We return to (TTSl , and collecting the terms in (IT5T ). (TT6b and (|T8b we have 

-l)'- fe sin7r(t - k) 



(18) 



E[ max 

. te[— n,2n] 



n ^ 

E Gfc - 

k=l 



(t-k) 



< 6E( max la^l ) + Cicr-v/log 3n 2 

\l<k<n J 



n. 



where we have applied inequality (3.13) in to E(maxx< fc < n |afe|) as well. 

Now we address the case when the {a fc }^ =1 are symmetrically distributed and satisfy P(|a fc | > 
M) = for all k. First we replace (fTTT) for an expression with the same distribtuion: 



G73 



dist 



max 

£=— n,...,2n; r= 



IE 

l,...,n-l I f— 
10 



sin 7T- 



X 1 n 1 



For r satisfying - < |, we have 



max 

l=—n,...,2n 



Sin7T- 

n 



k=l 11 1 



< M + max 

l=— n,...,2n 



E 

k=l,k^l 



a k 



and if | < - we remove the term indexed by k = I + 1 to obtain a similar expression. We then 
have 

l n-l+l 



E 



a-k 



l+ r --k 



E 

i=2 



J - 1 + r 



+ 



3=2 « 



when - < | and an anologous term when - > |. Now we need to slightly adjust the argument 
given in the proof of Proposition 13. II Similar to equations © and ©, for — n < I < 2n and 
1 < r < n — lwe define 



E 

i=2 



j — 1 + L 

J n. 



and 



n-l+l 

Ei 

i=2 



J 



for - < | and the anologous terms when - > \. We can now pursue bounds for the Y and the 
Z terms individually, as we did earlier. Just as in ©, for any L, if / < L and - < | we have 



1 

\Y l>r \ < M(l + — ) < M + M log(0 < M(l + logL). 

3=1 ^ n 

We define Y"/^ analogously to © and note that its variance is bounded by M 2 / (L + 1), so that 
the same type of exponential bound applies as in Proposition 13.11 Where we had a maximum 
over n — L random variables in © and used the union bound, we now have 3n 2 — L and again 
use a union bound, resulting in a factor 3n 2 where we had n in ©. To take care of the n 2 , we 
set 5 = M 2v ^" , rather than 5 = M ^ 2 ^ n as it was earlier. Then the same argument as was 
made in the proof of Proposition 13.11 applies here, thus giving the upper bound of C2 log log n. 

We now prove the lower bounds and start with the Gaussian case. Let a k ~ A/"(0, a 2 ) for a 
fixed a 2 for all k. Then, considering t at the integers, 



E I sup 



k=l 



sin ir(t — k) 



7r(t - k) 



> E 



E 



max 

te{l,2,...,n} 



max <2fc 

Kfc<n 



E 

k=l 



a k 



sin7r(t — k) 



7T(* - fc) 



The lower bound follows from the standard fact that the expected maximum of n independent 
Gaussian random variables with variance a 2 is of order o^/Togn. 
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Lastly, we show the lower bound for the random variables with bounded support. We 
consider t e {§, 2 + §, 4 + |, . . . , 2p=±] + §}. Then 



2' 1 2 

sin7r(t — A;) 



fc=i 



ir(t - fc) 



> 



max 



0<K[2=i] 



shi7r(2Z + | - fc) 



fc=i 



tt(2Z + I - A;) 



2 

]\2Z-fc 



max I > Ofe- | 



max > (— l) fc Ofc 



dist 



max 

o<i<p= 



fc=i 



?r(2Z 

1 



fc=l 



tt(2/ 



(19) 



where equality (fl9l) is in distribution and holds due to the assumption that the {a^}^ are 
symmetrically distributed. The lower bound on the expectation of (fl~9l) is proved analogously 
to the lower bound in Proposition l3.1[ □ 



We close with a question. In the communications setting discussed in the Introduction, one 
is interested in methods to reduce the peak value of a signal. While the results presented here 
show that peak value grows more mildly for shifted sine kernels than for orthonormal functions 
on the unit interval, how the peak can be reduced and the limits to certain methods for doing so 
are still of interest. One such method is to allow a subset of the coefficients to be random, and 
choose the remaining coefficients to reduce the peak of the signal resulting from the random 
coefficients. Limiting behavior for this scheme in the Fourier setting on the unit interval was 
addressed in [1J. However, it is not readily apparent what the analogous behavior for shifted 
sine kernels is. 
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